Learning Effective Navigational Strategies for Active Monocular Simultaneous Localization and Mapping

نویسندگان

  • Vignesh Prasad
  • K. Madhava Krishna
  • Ravindran Balaraman
  • Praveen Paruchuri
چکیده

Simultaneous Localization and Mapping (SLAM) refers to the problem of mapping an unknown environment that the robot is operating in and localizing itself in the unknown environment at the same time. Out of the various methods of performing SLAM, using a single monocular camera as the sole sensory input is highly preferred due to its simplicity and low power consumption. Range sensors such as laser range finders, depth cameras etc require much more power to operate and performing SLAM with them is more computationally intensive as compared to SLAM with a single camera. However, when compared to trajectory planning methods using depth-based SLAM, Monocular SLAM in loop does need additional considerations. One main reason being that for a robust optimization of the map and robot trajectory, using Bundle Adjustment (BA) in the case of most monocular SLAM methods, the SLAM system needs to scan the area for a reasonable duration to gather more information about the area to improve the map and pose estimates. Additionally, due to the way monocular SLAM methods work, they do not tolerate large camera rotations between successive views and tend to breakdown. Other reasons for Monocular SLAM failure include ambiguities in decomposition of the Essential Matrix, feature-sparse scenes and more layers of non linear optimization apart from BA. Learning a complex task such as low-level robot manoeuvres while preventing failure of monocular SLAM is a challenging problem for both robots and humans. The data-driven identification of basic motion strategies in preventing monocular SLAM failure is a largely unexplored problem. In this thesis, a computational model is devised for representing and inferring strategies for the problem, formulated as a Markov Decision Process (MDP), where the reward function models the goal of the task as well as information about the strategy. Reinforcement Learning (RL) is used with an intuitive, handcrafted reward function to generates fail safe trajectories wherein the SLAM generated outputs (scene structure and camera motion) do not deviate largely from their true values. This model is expanded on by treating it as an expert and try to learn an underlying true reward function for the given task at hand using Inverse Reinforcement Learning (IRL). Quintessentially, the framework successfully learns the otherwise complex relation between motor actions and perceptual inputs and uses this knowledge to generate trajectories that do not cause failure of SLAM. It also learns how a few chosen parameters affect the task at hand. This complex relation is almost intractable to capture in an obvious mathematical formulation. The framework allows one to identify the way in which a few chosen parameters affect the quality of monocular SLAM estimates. The estimated reward function was able to capture expert demonstration information and the inherent expert

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning Deeply Supervised Visual Descriptors for Dense Monocular Reconstruction

Visual SLAM (Simultaneous Localization and Mapping) methods typically rely on handcrafted visual features or raw RGB values for establishing correspondences between images. These features, while suitable for sparse mapping, often lead to ambiguous matches at texture-less regions when performing dense reconstruction due to the aperture problem. In this work, we explore the use of learned feature...

متن کامل

Learning monocular visual odometry with dense 3D mapping from dense 3D flow

This paper introduces a fully deep learning approach to monocular SLAM, which can perform simultaneous localization using a neural network for learning visual odometry (L-VO) and dense 3D mapping. Dense 2D flow and a depth image are generated from monocular images by sub-networks, which are then used by a 3D flow associated layer in the L-VO network to generate dense 3D flow. Given this 3D flow...

متن کامل

Map-merging in Multi-robot Simultaneous Localization and Mapping Process Using Two Heterogeneous Ground Robots

In this article, a fast and reliable map-merging algorithm is proposed to produce a global two dimensional map of an indoor environment in a multi-robot simultaneous localization and mapping (SLAM) process. In SLAM process, to find its way in this environment, a robot should be able to determine its position relative to a map formed from its observations. To solve this complex problem, simultan...

متن کامل

Integrating Monocular Vision and Odometry for SLAM

This paper presents an approach to Simultaneous Localization and Mapping (SLAM) based on monocular vision. Standard multiple-view vision techniques are used to estimate robot motion and scene structure, which are then integrated with minimal odometric information and used to build a global environment map. Preliminary experimental results are also presented and discussed. Key-Words: Robot local...

متن کامل

Novel Rao-Blackwellized Particle Filter for Mobile Robot SLAM Using Monocular Vision

This paper presents the novel Rao-Blackwellised particle filter (RBPF) for mobile robot simultaneous localization and mapping (SLAM) using monocular vision. The particle filter is combined with unscented Kalman filter (UKF) to extending the path posterior by sampling new poses that integrate the current observation which drastically reduces the uncertainty about the robot pose. The landmark pos...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017